Syllable weight encodes mostly the same information for English word segmentation as dictionary stress
نویسندگان
چکیده
Stress is a useful cue for English word segmentation. A wide range of computational models have found that stress cues enable a 2-10% improvement in segmentation accuracy, depending on the kind of model, by using input that has been annotated with stress using a pronouncing dictionary. However, stress is neither invariably produced nor unambiguously identifiable in real speech. Heavy syllables, i.e. those with long vowels or syllable codas, attract stress in English. We devise Adaptor Grammar word segmentation models that exploit either stress, or syllable weight, or both, and evaluate the utility of syllable weight as a cue to word boundaries. Our results suggest that syllable weight encodes largely the same information for word segmentation in English that annotated dictionary stress does.
منابع مشابه
Word segmentation in Persian continuous speech using F0 contour
Word segmentation in continuous speech is a complex cognitive process. Previous research on spoken word segmentation has revealed that in fixed-stress languages, listeners use acoustic cues to stress to de-segment speech into words. It has been further assumed that stress in non-final or non-initial position hinders the demarcative function of this prosodic factor. In Persian, stress is retract...
متن کاملAutomatic segmentation of English words using phonotactic and syllable information
It is difficult to demonstrate the effectiveness of prosodic features in automatic word recognition. Recently, we applied the suprasegmental concept and proposed an extra layer of acoustic modeling with syllables. Nevertheless, there is a mismatch between the syllable and the word units and that makes subsequent steps after acoustic modeling difficult. In this study, we explore English word seg...
متن کاملProbabilistic grammar and the Portuguese Stress Corpus
This paper proposes a weight-based probabilistic approach to stress in Portuguese. Previous analyses have argued that weight-sensitivity in the language is categorical and restricted to word-final syllables. I show that weight effects in Portuguese are gradient and can be found across all three positions in the stress domain (three-syllable window). I also compare two domains of weight computat...
متن کاملLearning to Learn: Infants’ Acquisition of Stress-Based Strategies for Word Segmentation
A majority of English words are stressed on their first syllable. Infants use stress as a cue to word segmentation, but it is unclear how infants discover the correlation between stress and word boundaries. We exposed English-learning infants to a list of words stressed on their second syllable to discover whether infants can learn a new relation between stress and word boundaries. English-lear...
متن کاملCollocation and Thai Word Segmentation
This paper presents another approach of Thai word segmentation, which is composed of two processes : syllable segmentation and syllable merging. Syllable segmentation is done on the basis of trigram statistics. Syllable merging is done on the basis of collocation between syllables. We argue that many of word segmentation ambiguities can be resolved at the level of syllable segmentation. Since a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014